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Abstract. Constraint Programming (CP)[1] has been successfully applied to both 
constraint satisfaction and constraint optimization problems. A wide variety of 
specialized global constraints provide critical assistance in achieving a good model 
that can take advantage of the structure of the problem in the search for a solution. 
However, a key outstanding issue is the representation of 'ad-hoc' constraints that 
do not have an inherent combinatorial nature, and hence are not modelled well 
using narrowly specialized global constraints. We attempt to address this issue by 
considering a hybrid of search and compilation. Specifically we suggest the use 
of Reduced Ordered Multi- Valued Decision Diagrams (ROMDDs) as the sup- 
porting data structure for a generic global constraint. We give an algorithm for 
maintaining generalized arc consistency (GAC) on this constraint that amortizes 
the cost of the GAC computation over a root-to-leaf path in the search tree with- 
out requiring asymptotically more space than used for the MDD. Furthermore we 
present an approach for incrementally maintaining the reduced property of the 
MDD during the search, and show how this can be used for providing domain 
entailment detection. Finally we discuss how to apply our approach to other sim- 
ilar data structures such as AOMDDs and Case DAGs. The technique used can 
be seen as an extension of the GAC algorithm for the regular language constraint 
on finite length input [2]. 



1 Introduction 

Constraint Programming (CP)[1] is a powerful technique for specifying Constraint Sat- 
isfaction Problems (CSPs) based on allowing a constraint programmer to model prob- 
lems in terms of high-level constraints. Using such global constraints allows easier 
specification of problems but also allows for faster solvers that take advantage of the 
structure in the problem. 

The classical approach to CSP solving is to explore the search tree of all possible 
assignments to the variables in a depth-first search backtracking manner, guided by 
various heuristics, until a solution is found or proven not to exist. One of the most basic 
techniques for reducing the number of search tree nodes explored is to perform domain 
propagation at each node. In order to get as much domain propagation as possible we 
wish for each constraint to remove from the variable domains all values that cannot 
participate in a solution to that constraint. This property is known as Generalized Arc 
Consistency (GAC). 



It is only possible to achieve GAC for some types of global constraints in practice, as 
a global constraint can model NP-hard problems making it infeasible to achieve GAC. 
The use of global constraints can significantly reduce the total number of constraints in 
the model, which again improves domain propagation if GAC or other powerful types of 
consistency can be enforced. However, in typical CSPs there are many constraints that 
he outside the domain of the current global constraints. Such constraints are typically 
represented as a conjunction of simple logical constraints or stored in tabular form. 
The former can potentially reduce the amount of domain propagation, while the tabular 
constraints typically takes up too much space for all but the most simple constraints, 
and hence it is also computationally expensive to achieve GAC. 

The aim of this paper is to introduce a new generic global constraint type for con- 
straints on finite domains based on the approach of compiling an explicit, but com- 
pressed, representation of the solution space of as many constraints as possible. To this 
end we suggest the use of Multi- Valued Decision Diagrams (MDDs). 

It is already known how to perform GAC in Unear, or nearly Unear time in the size 
of the decision diagram for many types of decision diagrams including MDDs [3-5]. 
However, compact as decision diagrams may be, they are still of exponential size in the 
number of variables in the worst case. In practice their size is also the main concern, 
even when they do not exhibit worst case behavior. Applying the static GAC algorithms 
at every step of the search is therefore Ukely to cause an unacceptable overhead in many 
cases. 

To avoid the overhead it is essential to avoid repeating computation from scratch at 
each step and instead use an algorithm that amortizes the cost of the GAC computation 
over a number of domain propagation steps. In this paper we introduce such an algo- 
rithm using ideas from [2]. The paper is structured as follows. In Section 1.1 we discuss 
in some detail how are our contributions are related to those contained in [2] as well as 
briefly describe other related results. In Section 2 we describe the functionaUty required 
from the global constraint we are presenting and the setup in which it can be used. In 
Section 3 we describe how the technique from [2] can be applied to a simplified MDD. 
We then move on to show how we can extend that approach to handle long edges in 
MDDs in Section 4. In Section 5 we present results on maintaining the reduced prop- 
erty of the constraint during a search. Based on this we show how domain entailment 
can be detected in Section 5.3. Finally we discuss how our approach can be apphed to 
other data structures that are similar to MDDs, such as AOMDDs and Case DAGs in 
Section 7. 

1.1 Related Work 

The concept of compiling an exphcit, but compact, representation of the solution space 
of a set of constraints has previously been applied to obtain backtrack-free configurators 
for many practical configuration problems [6]. In this case Binary Decision Diagrams 
(BDDs) [3] are used for representing the solution space. However, it is well known that 
BDDs (and MDDs) are not capable of efficiently representing prominent constraints 
such as the AllDifferent constraint [7]. 

A regular language constraint for finite sequences of variables is introduced in [2]. 
It uses a DFA to represent the vaUd inputs where the input is hmited to be of length 



n. Since the constraint considers a finite number of inputs, these can be mapped to n 
variables, and GAC can be enforced according to the domains of these variables. To this 
end the cycles in the DFA are 'unfolded' by taking advantage of the fact that the input 
is of a finite length. The resulting data structure has size 0{ndmaxq) where q is the 
number of states in the original DFA and d„iax the size of the largest variable domain. 
A bounded incremental GAC algorithm with complexity Unear in the number of data 
structure changes is also presented. Since this regular global constraint is defined on a 
finite length input, it can be used as a generic constraint. In fact, we note that there is a 
strong correspondence between the unfolded DFA and a simplified MDD representing 
the same constraint. However, there are some important theoretical and practical reasons 
for using fully reduced MDDs when the goal is a generic global constraint. Below we 
sunmiarize our contributions and highhght the differences compared to using the regular 
constraint in the role of a generic global constraint. 

- Firstly, DFAs do not allow skipping inputs, even for states where the next input 
is irrelevant. Skipping input variables in this manner is part of the reduce steps 
for BDDs, and if it is used in MDDs it requires alterations to the GAC algorithm. 
We give a modified algorithm to handle this. In some cases allowing the decision 
diagram to skip variables can give a significant reduction in size. A very simple 
example is a constraint specifying that the value v must occur at least once for one 
of the variables xi, . . . ,Xn each having the same domain of size d. In an MDD that 
does not allow skipped variables (such as an unfolded DFA) this requires Q{rP'd^) 
nodes compared to 0{nd) nodes if we allow skipped variables in the MDD. 

- Secondly, BDDs are normally kept reduced during operations on the BDDs. This 
allows subsequent operations to run faster and also shows directly if the result is 
the constant true function. We present an approach that can dynamically reduce the 
MDD without resorting to scanning the entire live part of the data structure, and 
which also allows us to detect domain entailment [8]. Domain entailment detection 
can be critical for the performance of a CSP solver, as it can save a potentially 
exponential number of exectutions of the GAC algorithm. The technique described 
in [2] does not provide dynamic reduction or domain entailment detection. 

The suggestion in [2] is to minimize the DFA only once at the beginning (thereby 
also minimizing the initial 'unfolded' DFA) which would correspond to reducing 
the MDD prior to the search, and does not provide any form of entailment detection. 
The problem of efficient dynamic minimization is not discussed in [2] and would 
seem to require a technique similar to the one we present in this paper for obtaining 
dynamic reduction of the MDD constraint. 

- Thirdly, we cover how the GAC algorithm can be adapted to operate on other deci- 
sion diagram style data structures such as AOMDDs [4] and Case DAGs [9]. 

Finally, from a practical perspective it is not a good idea to first construct a DFA 
and then unfold it. It is more efficient to use a BDD package to construct an ROBDD 
directly, as efficient BDD packages [10, 1 1] with a focus on optimizing the construction 
phase have already been developed driven by needs in formal verification [12]. Specif- 
ically the use of BDDs for the construction gives access to the extensive work done on 
variable ordering (see for example [13-15]) for BDDs. Once an ROBDD is constructed 
it can then easily be converted into the desired MDD. 



Another related result is [16] in which it is discussed how to maintain Generalized 
Arc Consistency in a generic global constraint on binary variables based on a BDD. 
Their technique differs from the straightforward DFS scarming technique by using 
shared good/no-good recording and a simple cut-off technique to in some cases reduce 
the amount of nodes visited in a scan. Their technique can be adapted for non-binary 
variables, but the cut-off technique lose merit if the MDD is to be reduced dynamically 
and becomes much less efficient as the domains increase in size. Hence their techniques 
do not apply when the intention is to collect sets of small constraints into a few larger 
MDD constraints. 

1.2 Notation 

In this paper we consider a CSP problem CP(X, D,F), where X = {xi, . . . , a;„} is the 
set of variables, F the set of constraints on the variables in X and D = {Di, .... 
is the multi-set of variable domains, such that the domain of a variable Xi is Dj,. We use 
di = \Di\ to denote the size of domains and dmax = max{c?i | Xi G X} to denote the 
largest domain. A single assignment is a pair {xi,v) where Xi G X and v G Di. The 
assignment (xj, v) is said to have support in a constraint F^, iff there exists a solution to 
Ffe where Xi is assigned v. If a single assignment (xi, v) for which v G Di has support 
in a constraint Fj, v is said to be in the valid domain for Xi respective to Fj , denoted 
VDj(i^j). If for all variables Xi and a constraint Fj it is the case that Di = YDi{Fj) 
then Fj is said to fulfill the property of Generalized Arc Consistency (GAC). A partial 
assignment p is a set of single assignments to distinct variables, and difull assignment 
is a partial assignment that assigns values to all variables. 

We now define an MDD constraint belonging to a specific CSP. 

Definition 1 (Ordered Multi- Valued Decision Diagram (OMDD)). An Ordered Multi- 
Valued Decision Diagram ( OMDD or just MDD) for a CSP CP is a layered Directed 
Acyclic MultiGraph G{V, E) with n + \ layers (some of which may be empty). Each 
node u has a label l{u) G {1, . . . , n -I- 1} corresponding to the layer in which the node 
is placed, and each edge e outgoing from layer i has a value label v{e) G Di. Further- 
more we use s(e) and d{e) to denote respectively the source and destination layer of 
each edge e . 

The following restrictions apply: 

- There is exactly one node u such that l{u) — min{Z(g) | q G V} denoted root. 

- There is exactly one node u such that l{u) = n-\- 1 denoted terminal. 

- For any node u, all outgoing edges from u have distinct labels. 

- All nodes except terminal has at least one outgoing edge. 

- For all e £ E it is the case that s{e) < d{e). 

We will use Vi = {u & V \ l{u) = i} to denote the nodes of layer i and E^ = 
{e G E \ s{e) — i} to denote the set of edges originating from layer i. Furthermore we 
define 

Pu = {(p, v)\3e = {p, u) eE : v{e) = v} 

and 

Cu = {(c, v)\3e = {u, c) eE: v{e) = v} 



That is, Pu corresponds to the incoming edges to u, and C„ corresponds to the 
outgoing edges of u. Given layers i > j > k,we say that i is a later layer than j and 
that k is an earlier layer than j. 

Definition 2 (Solution to an MDD). A full assignment p is a solution to a given MDD 
iff there exists a path Q = (ei , . . . , ) from root to terminal such that for each (xj , v) € 
p at least one of the following conditions hold: 

- 3e € Q such that s{e) = i and v{e) = v 

- l{root) > i 

- 3e G Q such that s(e) <i < d{e). 

An edge e such that s(e) + l < d{e), as in the third condition in the above definition, 
is called a long edge and is said to skip layer .s(e) + 1 to d{e) — 1. It it is worth noting 
that one long edge can represent many partial assignments. 

Definition 3 (Reduced OMDD). An MDD is called uniqueness reduced iff for any two 

distinct nodes u-i,U2 at any layer i it is the case that ^ C„2. If it is furthermore 

the case for all layers i G {1, .... n} that no node ui in layer i exists with di outgoing 
edges to the same node U2, the MDD is said to be fully reduced. 

The above definitions are just the straightforward extension of the similar properties 
of BDDs[3] . Fully reduced MDDs retain the canonicity property of reduced BDDs, that 
is, for each ordering of the variables there is exactly one fully reduced MDD for each 
Boolean constraint on n discrete domain variables. An example reduced MDD is shown 
in Figure 1 . 
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Fig. 1. The above figure shows an example MDD that is fully reduced. Assuming Di = 
{1, 2, 3, 4} and D2 = {1, 2, 3}, it represents the binary constraint with solutions { (1,3),(2,1), 
(3,3),(4,1),(4,2),(4,3) }. 



2 Searching with an MDD 



In this paper we consider a backtracking search for a solution to a conjunction of con- 
straints, at least one of which is an MDD. To simplify the complexity analysis, we as- 
sume that the search branches on the domain values of each variable in some specified 
order and that full domain propagation takes place after each branching. The process of 
branching and performing full domain propagation we will refer to as a phase. For our 
proposed global constraint we refer to performing domain propagation on it as a single 
step. As such, one phase may contain many steps depending on how many iterations 
it takes until none of the constraints are able to remove any further domain values. In 
order to be useful in this type of CSP search, an implementation of the MDD constraint 
needs to supply the following functionaUty: 

- ASSIGN(x,z;) 

- REMOVE((a;i, ■yi), . . . , {xk,Vk)) 

- BacktrackO 

The Assign operation restricts the valid domain of x to v and is used to perform 
branchings. The REMOVE operation corresponds to domain restrictions occurring due 
to domain propagation in the other constraints. The BACKTRACK operations undoes the 
last Assign operation and all Remove operations that has occurred since, effectively 
backtracking one phase in the search tree. For the implementation of BACKTRACK we 
will simply push data structure changes on a stack, so that they can be reversed easily 
when a backtrack is requested. This very simple method ensures that a backtrack can 
be performed in time linear in the number of data structure changes made in the last 
phase. For all the dynamic data structures considered in this paper the space used for 
this undo stack will be asymptotically bounded by the time used over a root-to-leaf path 
in the search tree. Furthermore, in all the cases studied in this paper, AssiGNCxi, v) is 
just as efficiently implemented as a single call to REMOVE({(xi, Vj) \ vj € Di\ {«}}). 
Therefore we will only discuss the implementation of Remove. 

3 Calculating the change in valid domains 

In the following we will say that an edge e is vaUd iff there exists a path in the MDD, 
corresponding to a solution to the MDD under all restrictions applied so far, such that e 
is part of the path. We furthermore say that a node is vaUd if it is root or has at least one 
vaUd incoming and outgoing edge. A crucial element in computing the vaUd domains in 
an MDD is that of a supporting edge. A valid edge e E E supports a single assignment 
{xi, v), if s(e) = i and v{e) = ti or if s(e) < i < d{e). Note that v G Di iff there exists 
an edge e supporting the assignment (xt , v). We say that a node u supports a value v if 
there exists an edge (u, c) supporting (xn^^, v) for some c. 

In this section we will show how the approach from [2] can be used for maintaining 
the valid domains of a simplified MDD. We will temporarily assume that the OMDD 
we operate on is initially Uniqueness Reduced, but not fully reduced, in fact we will 
assume that all outgoing nodes from a node in layer i lead to nodes in layer i + 1 (and 
hence also that l{root) = 1). 



3.1 Tracking support 



Valid domains are maintained by tracking the loss of supporting edges for each possible 
assignment as restrictions of the form Xi ^ v aie applied. The application of a restric- 
tion Xi V immediatly invalidates all edges originating from layer i with label v. We 
need to compute what further supporting edges are invalidated as a consequence and 
determine if any assignments have lost their supporting edges. 

We track the set of supporting edges by storing a set of sets S, such that for every 
possible single assignment (xi,v) where Xi E X and v E Di there exists a set y € S 
containing all the nodes and the corresponding edges that gives support to the single 
assignment (xi.v). In maintaining S we learn immediately when a single assignment 
{xi, v) no longer has support, as „ will be empty. Note that the space needed for the 
support lists is only 0(|i?|). The approach to maintaining S is very simple. To apply 
a restriction Xi ^ v, we first invalidate all edges supporting this assignment. We then 
check if this has made any nodes invalid, and if so invalidate their edges, continuing 
recursively. 

3.2 Performing Remove 

The above support tracking technique is implemented as the procedures REMOVE and 
RemoveEdge shown in Figure 2. The remove operation is performed by, for each sin- 
gle assignment {xi , v) to be removed, visiting all nodes that offer support for (xi,v). 
On each such node the update procedure RemoveEdge is used to remove the corre- 
sponding invalidated edge while maintaining S and the valid domains. 

3.3 Complexity 

The RemoveEdge operation can easily be supported in 0(1) time per call using 
0{\E\) space using just linked lists for the support sets and pointers from each edge 
to their corresponding support list entry. 

Consider a path in the search tree implicitly represented by the branching search. 
We will here and in the following describe the complexity of the presented algorithms 
as the complexity over such a root-to-leaf path in the search tree. Since each edge can 
only be removed once during this part of the search we obtain the following result: 

Lemma 1. Consider any root-to-leaf path in the search tree. Then the total number of 
calls to RemoveEdge is at most 0{\E\). 

4 Skipping input variables 

We have so far obtained a very efficient GAG algorithm for a simplified MDD data 
structure by simply applying the technique from [2] to our MDD constraint. However, 
the algorithm described so far does not allow the MDD to be fully reduced. Specifically 
it does not take long edges or the possibility that the root is not placed in layer 1 into 
consideration. If the root is in a layer z > 1, we will simply add a long edge that skips 
layer 1 to i — 1, such that we only need to deal with long edges. 



4.1 Handling long edges 

For the purpose of valid domains computation, the important observation is that if a 
valid long edge exists skipping layer i to j, then all available domain values for the 
corresponding variables are also supported. For static valid domains computation [5] 
these variables are found by for each level finding the longest outgoing edge. Given 
this information it is simple to list the variables which have full support due to a long 
edge skipping over their layer in time 0{n). We obtain a dynamic equivalent of this 
algorithm by maintaining the length of the longest long edges originating from each 
level. Additionally each long edge e is placed in the support list of {xs(e) , v{e)), as if 
it was a regular edge. In order to track the longest outgoing edge of each level we will 
maintain the set L of distinct intervals [i, j] such that there exists at least one long edge 
skipping layers itoj. In order to detect when a skipped interval should be removed from 
L, we for each skipped interval [i, j] G L maintain a counter lij, giving the number of 
long edges skipping exactly the layers i to j. A long edge e skipping layer i to j can 
no longer support the skipping of the interval iff e is invalidated. In this case 
is decremented, and in case kj = the skipped interval [i, j] is removed from L. Note 
that we do not perform any update if a Remove call actually 'cuts' a long edge into two 
parts. This means that we allow the intervals to support values that are invalid. However, 
the only invalid values supported in this way are those that are given as arguments to 
Remove, and have therefore already been removed from the valid domain values by 
another constraint. 

Recall that our goal is to maintain the longest outgoing edge from each layer in the 
MDD. Using the above we can do this simply by maintaining a max-priority queue for 
each level i over the the set of skipped intervals of the form [i, j], for some j, using j 
as key. Assuming the priority queue supports reporting the maximum in time 0(1) we 
can obtain the required table in 0{n) time. It should now also be clear why we build 
the priority queue over the distinct intervals instead of all the intervals represented by 
long edges. Had we used the long edges directly, all edge deletions would require a 
delete operation on the priority queue, while we now only need a delete operation when 
a distinct interval no longer has any supporting long edges. The algorithm is given in 
Figure 2 and the following lemma gives its complexity. 

Lemma 2. On a root-to-leaf in the search tree the complexity of the longest-outgoing- 
edge based approach is bounded by 0{\E\ + Ig Ig n + ri^dmax)- 

Proof. As previously, the actual time for handling normal edges is at most O ( | -B | ) . Each 
interval can only be removed once, and each such deletion costs 0(lglg using a 
VEB-based priority queue [17]. Finally we spent 0{n) time per step to compute the 
table of longest outgoing edges and compute the variables covered by long edges. In 
total this yields a complexity over a root-to-leaf path in the search tree of 0(|i?| + 
\L\ Iglg \L\ -h tn), where t is the number of steps. Since there can at most be O(n^) 
distinct long edge intervals and ndmax steps this yields 0{\E\ -\- \g\gn -\- n^dmax)- 

We believe that for most practical applications of the MDD constraint the above 
complexity will be completely dominated by the \E\ factor, and hence that the addi- 
tion of long edges result in no significant performance impact. As an alternative so- 
lution we can use the dynamic interval union data structure (DIU) presented in [18] 



Init() 

1 Initialize each support list 



2 L ^ the set of intervals skipped by at least one long edge 

3 hj ^ the number of long edges skipping exactly layer i to j 

4 Df ^ the values for Xi supported by the support lists 

5 [/^ {!,..., n} 



6 Remove(0) > Initial domain propagation 

REMOVE(i?) 

1 for each {xi,v) G R 

2 do for each (u, c) £ Si^v 

3 RemoveEdge(u, c, v) 

4 U' -f— union of the intervals in L 

5 for each l€U\U' 

6 do L», = Df 
1 ifA=0 

8 then Constraint failed 

9 ?7' 

RemoveEdge(m, c, 

1 \ {('^i '^)} 

2 if s;(u) „ = > (xi{u),v) no longer supported in support Usts 

3 thenDf(„) ^D,=(„)\M 

4 a^C„\{(c,w)} 

5 P. ^P.\{(h,^')} 

6 if i(c) — > 1 > If the removed edge is a long edge 



7 then decrement Z;(„)+i,((c)-i 

8 if = 

9 then L^L\ {[l{u) + 1, l{c) - 1]} 

10 if Cu = > w has no outgoing edges 

1 1 then for each {p, v') G Pu > Remove incoming edges 

1 2 do RemoveEdge(p, u,v') 

13 if Pc = t> w has no incoming edges 

14 then for each {c',v') € Cc > Remove outgoing edges 

15 do RemoveEdge(c, c',v') 



> Remove edges supporting {xi,v) 
> Layer I is no longer skipped by a long edge 



Fig. 2. Remove calls RemoveEdge for each edge that must be removed due the restrictions 
in R. Afterwards it recomputes the layers supported by long edges L and updates the domains 
accordingly. RemoveEdge takes as input an invalid edge in form of the source node u, the 
destination node c and the corresponding value label v. It then moves from the invalid edge 
downwards in depth first manner as long as there are nodes being invalidated due to lacking vaUd 
incoming edges. If the node u has no more outgoing edges it propagates upwards, removing nodes 
that have no more outgoing edges. Recall that P„ corresponds to the incoming edges of u and Cu 
to the outgoing edges. For each edge it removes it decrements the counter of the corresponding 
skipped interval and updates L if needed. 



to store the intervals. Using this data structure it is possible to obtain a complexity 
of 0{\E\ + n^lgn + ndmax) [19]. However for practical applications, the longest- 
outgoing-edge approach using a simple binary heap is most likely preferable (yielding 
0{\E\ + Ig n + ri^dmax)) due to the simple implementation and low overhead. 

5 Maintaining the reducedness property 

In the above we do not take steps to maintain the uniqueness reduced property of the 
MDD when we update the data structure. This forfeits a chance for a large speed-up. 
If a reduction at an early search node s would lead to large reduction in the size of the 
data structure all descendant search node of ,s (of which there can be an exponential 
number) would benefit from working on a much smaller data structure. As an example 
of the effect of dynamic reduction consider the simple constraint / encoding the rule 
Xi < X2,xi < .T3 . . . ,xi < Xj with domains Di = {1, . . . ,k} for some constant k. 
Let MDD{f ) be the MDD representing / and let fv = f A xi = v. Now consider 
the removal of the value 1 from the domain X2, - ■ ■ ,Xj (as could be induced by an ex- 
ternal AUDifferent constraint). With this restriction MDD( fi) becomes equivalent to 
MDD{f2) and can be merged, reducing the size of the MDD required to store the con- 
straint with a constant factor. If the value 2 is lost next then a further constant fraction of 
the MDD can be removed due to the reduce step as /i = /2 now becomes equivalent to 
/3. This is of course a very simplistic constraint easily propagated using other methods, 
but if we consider the conjunction of the constraint with another constraint, the exam- 
ple still applies in many cases, especially if the new constraint does not depend on the 
value of xx . One example of such an additional constraint is Vi G [2, j — 1] : Xi 7^ Xi+i . 
Note that if we ensure the uniqueness reduced property the MDD will be fully reduced 
according to the original domains throughout the search (assuming it is fully reduced 
initially), since there is no risk that new long edges will be created when we only per- 
form domain restrictions. We therefore first discuss how to ensure the uniqueness re- 
duced property and then describe the addition necessary in order to obtain full reduction 
according to the current domains. 

5.1 Dynamic reduction 

Assuming that the MDD is uniqueness reduced for all layers later than layer i, a node 
u in layer i becomes redundant iff at least one of its outgoing edges are modified such 
that Cu = Cq for some node q in layer i. This redundancy can be removed by merg- 
ing the redundant nodes. Two redundant nodes u, q are merged by removing u and 
redirecting all edges ending in u to g or vice versa. In the first case we say that u is 
the subsumee, and q the subsumer. Note that a merger can give rise to redundancies in 
earlier layers, but not in later layers. We will therefore dynamically reduce the MDD 
by maintaining maintain a set of 'dirty' nodes that need to be checked for redundancy 
during the operation of REMOVE. Afterwards we check the dirty nodes for redundancy 
in a bottom-up manner, ensuring that later layers are uniqueness reduced, before earlier 
layers are considered. 



Redundancy detection In order to efficiently check whether a given node has become 
redundant we will for each node maintain a hash signature computed based on the 
node's level and outgoing edges. Using this signature and a hashtable we can discover 
nodes that become redundant. Some care must be take to ensure that the signature can 
be updated in 0(1) time when an edge changes. This can be ensured using a variation 
of vector hashing [20], which we describe in [19]. We would like to stress that vector 
hashing has a very low overhead, and has been shown to be competitive in practice. 
Additionally, to ensure that collisions in the hash table do not give rise to expensive 
comparisons between the outgoing edges of two nodes, we utilize the technique of 
generating a hash signature longer than necessary to index the hash table. Using this 
technique nodes need only to be compared when their full signature matches, even 
though they reside in the same bucket in the hash table. In this way it is possible to 
ensure that inserting a node which is not redundant takes expected time 0(1) and that 
insertion of a redundant node u takes expected time 0(|C„|). A detailed description 
can be found in [19]. 

Merging nodes Given two nodes ui and U2 to merge we always designate the one 
with the largest number of parents as the subsumer in order to reduce the total cost of 
the merge operations. An edge e is only redirected when its end-point c is subsumed. 
Since this only happens when another node of larger in-degree becomes identical to c 
the in-degree required to cause e to be redirected must at least double each time e is 
moved. Hence an edge can only be moved [lg(|y |)] times as \V\ is an upper bound on 
the in-degree of a node. Note that this is a very simple and classic greedy strategy that 
incurs no significant overhead. 

Complexity of dynamic reduction 

Lemma 3. The expected time spent over a root-to-leaf path in the search tree by RE- 
MOVE on reducing the MDD is 0{\E\\g\V\). 

Proof. Since each edge can only be redirected [lg( | F | )] times the total cost of redirect- 
ing edges is O ( I i? I Ig I y I ) . Checking and discovering a redundant node u and removing 
its outgoing edges takes time 0(|C„ |). Note that each end-point of the deleted edges 
must have an in-degree of at least two before the merger, and therefore no further edges 
will need to be removed. This takes time at most 0{\E\) as edges cannot be more than 
once over a root-to-leaf path in the search. Checking a node that is not redundant takes 
expected time 0(1), and occurs only once per redirection or removal of an edge. The 
total expected time for this is therefore 0(|i?| Ig |y |), by the bound on the number of 
edge redirections. 

5.2 Full reduction based on current domains 

The reduce step described above keeps the MDD fully reduced according to the original 
domains. This means that while the MDD is uniqueness reduced it is not fully reduced 
according to the current domains. As an example consider an MDD with 1 variable 



xi and a single node ui with edges labelled 1 and 2 going to terminal. If the domain 
of x\ is {1, 2, 3} this MDD is fully reduced, while it reduces to the terminal node if 
the domain is {1,2}. Let v{u) to denote the set of values supported by u, then full 
reduction according to the current domains can be achieved by applying the following 
rule: If there exists a node u for which v{u) = such that all outgoing edges from 
u have the same endpoint we wiU consider it redundant and reduce it into a long edge. 
We observe that it is now possible for a node u to become redundant without having its 
outgoing edges modified: Consider a node u in layer i such that all its outgoing edges 
lead to the same node and and v{u) = Di \ {v}, where v e Z)j. Should v be removed 
from Di through a restriction, u will now be redundant. In order to detect this efficiently 
we will keep track of the set of nodes = {u G V | 3c G V : V(c', u) G C„ : c' = c}, 
as these nodes are the only candidates for being reduced using the new reduction rule. 
Since u must either be in Vr initially or enter Vr as children of u are merged, we can 
maintain Vr during merge operations. This is easily done in 0(1) per edge modification 
and thus do not affect the asymptotic complexity. 

When a domain is modified we need to find and reduce all nodes u € Vr for 
which v{u) = Di. This can be done efficiently by creating a hash table that maps 
nodes in Vr, using v{u) as key. Should the domain Di be modified, reducible nodes 
can be found simply by looking up Di in the hash table. This lookup yields all the 
nodes of Vr for which v{u) = Di, which are exactly those nodes for which the new 
reduction rule applies. The hash signature of the nodes and domains can be maintained 
in 0(1) per edge or domain modification using vector hashing. The space used for this 
is insignificant compared to the space required to store the edges of the MDD since it 
is only a small subset of V that is inserted into the table. Furthermore, since a node can 
only be removed by a reduction once over a root-to-leaf path in the search tree this does 
not affect the amortized complexity of the previous reduction technique. 



5.3 Domain entailment detection 

Given a constraint Fk, and a partial assignment p, let solp{Fk) be the set of vectors 
of domain values corresponding to solutions allowed by Fk that are consistent with 
p. A constraint Fk is said to be domain entailed under domains D iff Xi<i<„Dj C 
soZp(Ffe)[8]. That is, if all possible solutions to the CSP based on the available domains 
will be accepted by Fk, then Fk is entailed by the constraints implicit in the domains. It 
is beneficial to be able to detect domain entailment as it allows the solver to disregard 
the entailed constraint until it backtracks through the search node where the constraint 
was first entailed. If an MDD is kept fully reduced according to the current domains it is 
entirely trivial to detect domain entailment as an domain entailed MDD will be reduced 
to the terminal node. If the MDD is only kept uniqueness reduced and is domain entailed 
it is easy to see that it will consist of a path of up to n+ 1 nodes. Note that this state of the 
MDD is both necessary and sufficient for domain entailment assuming that the MDD 
has performed the most recent domain propagation step. Hence, checking for domain 
entailment just require that we maintain a node count for each level. If all node counts 
are one or less, and the constraint has not failed, the constraint is domain entailed. 



6 Constructing the MDD 



In order to apply our approach we first need to construct the MDD and compute the 
necessary auxiUary data structures. The input to constructing the MDD is assumed to be 
a set of constraints expressed in discrete variable logic. For example, tabular constraints 
could be expressed as disjunction of tuples, while an AllDifferent could be expressed as 
V(xi ,Xj) € X^,i ^ j : Xi ^ Xj. We suggest to construct the MDD by first building the 
ROBDD of the component constraints using Ig di binary variables to represent domain 
values for Xj (see for example [6]). This allows utilization of the optimized ROBDD 
Ubraries available. Assuming that the binary variables encoding each domain variable 
are kept consecutive in the variable ordering it is trivial to construct the MDD from the 
HDD using time hnear in the resulting MDD. This also means that the many variable 
ordering heuristics available for BDDs, which can substantially reduce the size of the 
BDD, can be utilized as long as they are restricted to grouping the binary variables 
according to the corresponding domain variable. The initial data structures required can 
be computed using a simple DFS scan of the MDD. The time that is acceptable for 
the compilation phase (and therefore also the allowable size of intermediate and final 
MDDs) depend on whether the constraint system is to be solved once or whether it 
is used in for example a configurator where the solver is used repeatedly on the same 
constraint set (with different user assignments) to compute the valid domains [21]. One 
could easily specify a large set of constraints and incrementally combine them into 
fewer and fewer MDD constraints until a time or memory limit is reached and still gain 
the benefit of improved propagation. 

7 Application to other data structures 

The techniques covered in this paper can also be applied to some other data structures 
that represent a compilation of a solution space. In this section we briefly outline what 
can be achieved for two specific data structures which are very sinailar to ROMDDs. 

7.1 AND/OR Multi- Valued Decision Diagrams (AOMDDs) 

AOMDDs were introduced in [4], and from the perspective of a GAC algorithm can be 
thought of as introducing AND nodes into the MDD, such that each child of an AND 
node roots an AOMDD which scope is disjoint from its siblings. This data structure 
is potentiaUy more compact than an MDD. We note that the support list technique for 
maintaining the vaUd domains is also applicable to AOMDDs. The only change is that 
an AND node becomes invalid if it loses any of its outgoing edges as opposed to all 
outgoing edges for an OR node. Furthermore since an AOMDD is reduced in a manner 
similar to an ROBDD our approach for dealing with long edges and obtaining dynamic 
reduction can also be applied without substantial changes. 

7.2 Interval edges 

In an ordinary MDD each edge corresponds to a single domain value, except for long 
edges which in skipped layers represent the entire domain. It is quite natural to consider 



the generalization to edges that represent a subset of a domain. One particular useful 
generalization of edges is to let each edge correspond to some interval of the domain 
values. This approach is used in Case DAGs [9, 22] which resemble MDDs without long 
edges, but where edges represent disjoint intervals instead of single values. If we use the 
technique from Section 3 and construct support lists we will use an excessive amount of 
space and no longer be able to claim aO{\E\) complexity over a root-to-leaf search path 
as each edge can occur in up to d^ax support lists. This can be avoided by abandoning 
support lists and instead observing that interval edges in many ways are similar to long 
edges. By keeping track of the distinct intervals of values supported by at least one edge 
in each layer, computing the valid domains reduces to computing the union of this set of 
intervals. Based on this observation it is possible to achieve a complexity of 0{nd^^^ + 
\E\) for maintaining GAC over a root-to-leaf path in the search tree. The details can be 
found in [19]. This compares with a worst-case complexity of n{djnax\E\) for a single 
step using a naive DFS approach. Additionally our dynamic reduction technique is also 
applicable, the only non-trivial change being that interval edges must be updated if their 
beginning or end values are removed from the current domains. If we fail to perform 
this update an edge e = {u, c) for which l{e) = [vi + 1, ('2] will not be considered 
identical to an edge e' = {u, c) for which / (e) = [vi , V2] when vi is removed from the 
domain of 

8 Conclusion 

This paper introduced the ROMDD global constraint and thereby provided a possible 
solution to the problem of representing ad-hoc constraints in constraints satisfaction 
problems. In addition to providing an efficient GAC algorithm for this constraint, we 
have shown how to keep the underlying decision diagram reduced dynamically during 
search in an efficient manner that also allows efficient domain entailment detection. The 
MDD global constraint can be used to efficiently represent the solution space of a set of 
simpler constraints. Based on whether the constraint problem needs to be solved once or 
many times, the time allocated to including smaller constraints in the MDD constraint 
can be easily adjusted. Since the constraint uses a reduced decision diagram to represent 
the solution space of the constraint it can also be used to represent tabular constraints 
in a compressed manner while still allowing a complexity that relates to the size of the 
data structure and not the number of solutions stored. Finally we have demonstrated 
how our approach can be applied to the AOMDD and Case DAG data structures. 
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